Dataset statistics
| Number of variables | 25 |
|---|---|
| Number of observations | 79293 |
| Missing cells | 220864 |
| Missing cells (%) | 11.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 15.1 MiB |
| Average record size in memory | 200.0 B |
Variable types
| Numeric | 14 |
|---|---|
| Categorical | 9 |
| Boolean | 2 |
event_date has a high cardinality: 12638 distinct values | High cardinality |
location has a high cardinality: 25264 distinct values | High cardinality |
injury_severity has a high cardinality: 124 distinct values | High cardinality |
make has a high cardinality: 6707 distinct values | High cardinality |
model has a high cardinality: 11330 distinct values | High cardinality |
Unnamed: 0 is highly correlated with Year and 1 other fields | High correlation |
total_fatal_injuries is highly correlated with injuries | High correlation |
total_serious_injuries is highly correlated with injuries | High correlation |
total_minor_injuries is highly correlated with injuries | High correlation |
total_uninjured is highly correlated with pax_onboard and 1 other fields | High correlation |
Year is highly correlated with Unnamed: 0 and 1 other fields | High correlation |
injuries is highly correlated with total_fatal_injuries and 2 other fields | High correlation |
pax_onboard is highly correlated with total_uninjured and 1 other fields | High correlation |
survived is highly correlated with total_uninjured and 1 other fields | High correlation |
df_index is highly correlated with Unnamed: 0 and 1 other fields | High correlation |
Unnamed: 0 is highly correlated with Year and 1 other fields | High correlation |
total_fatal_injuries is highly correlated with injuries and 2 other fields | High correlation |
total_minor_injuries is highly correlated with injuries | High correlation |
total_uninjured is highly correlated with injuries and 2 other fields | High correlation |
Year is highly correlated with Unnamed: 0 and 1 other fields | High correlation |
injuries is highly correlated with total_fatal_injuries and 3 other fields | High correlation |
pax_onboard is highly correlated with survived | High correlation |
fatality_percentage is highly correlated with total_fatal_injuries and 3 other fields | High correlation |
survived is highly correlated with total_fatal_injuries and 3 other fields | High correlation |
df_index is highly correlated with Unnamed: 0 and 1 other fields | High correlation |
Unnamed: 0 is highly correlated with Year and 1 other fields | High correlation |
total_fatal_injuries is highly correlated with injuries and 2 other fields | High correlation |
total_uninjured is highly correlated with injuries and 1 other fields | High correlation |
Year is highly correlated with Unnamed: 0 and 1 other fields | High correlation |
injuries is highly correlated with total_fatal_injuries and 2 other fields | High correlation |
pax_onboard is highly correlated with survived | High correlation |
fatality_percentage is highly correlated with total_fatal_injuries and 2 other fields | High correlation |
survived is highly correlated with total_fatal_injuries and 3 other fields | High correlation |
df_index is highly correlated with Unnamed: 0 and 1 other fields | High correlation |
Unnamed: 0 is highly correlated with df_index and 1 other fields | High correlation |
amateur_build is highly correlated with AmateurBuilt | High correlation |
aircraft_damage is highly correlated with weather_conditions and 2 other fields | High correlation |
total_serious_injuries is highly correlated with injuries and 2 other fields | High correlation |
pax_onboard is highly correlated with engine_type and 2 other fields | High correlation |
df_index is highly correlated with Unnamed: 0 and 1 other fields | High correlation |
number_of_engines is highly correlated with engine_type | High correlation |
injuries is highly correlated with total_serious_injuries and 2 other fields | High correlation |
total_fatal_injuries is highly correlated with total_serious_injuries and 1 other fields | High correlation |
engine_type is highly correlated with pax_onboard and 2 other fields | High correlation |
weather_conditions is highly correlated with aircraft_damage | High correlation |
Year is highly correlated with Unnamed: 0 and 1 other fields | High correlation |
fatality_percentage is highly correlated with aircraft_damage | High correlation |
total_minor_injuries is highly correlated with total_serious_injuries and 1 other fields | High correlation |
survived is highly correlated with pax_onboard and 2 other fields | High correlation |
AmateurBuilt is highly correlated with amateur_build | High correlation |
phase_of_flight is highly correlated with aircraft_damage | High correlation |
total_uninjured is highly correlated with pax_onboard and 1 other fields | High correlation |
amateur_build is highly correlated with AmateurBuilt | High correlation |
AmateurBuilt is highly correlated with amateur_build | High correlation |
aircraft_damage has 2410 (3.0%) missing values | Missing |
number_of_engines has 3986 (5.0%) missing values | Missing |
engine_type has 3374 (4.3%) missing values | Missing |
total_fatal_injuries has 23309 (29.4%) missing values | Missing |
total_serious_injuries has 25551 (32.2%) missing values | Missing |
total_minor_injuries has 24460 (30.8%) missing values | Missing |
total_uninjured has 12344 (15.6%) missing values | Missing |
phase_of_flight has 6054 (7.6%) missing values | Missing |
injuries has 29557 (37.3%) missing values | Missing |
pax_onboard has 29671 (37.4%) missing values | Missing |
fatality_percentage has 29698 (37.5%) missing values | Missing |
survived has 29671 (37.4%) missing values | Missing |
total_fatal_injuries is highly skewed (γ1 = 29.51903889) | Skewed |
total_serious_injuries is highly skewed (γ1 = 37.45361961) | Skewed |
total_minor_injuries is highly skewed (γ1 = 66.84890696) | Skewed |
injuries is highly skewed (γ1 = 30.77744311) | Skewed |
Unnamed: 0 is uniformly distributed | Uniform |
df_index is uniformly distributed | Uniform |
Unnamed: 0 has unique values | Unique |
df_index has unique values | Unique |
number_of_engines has 1275 (1.6%) zeros | Zeros |
total_fatal_injuries has 40092 (50.6%) zeros | Zeros |
total_serious_injuries has 42660 (53.8%) zeros | Zeros |
total_minor_injuries has 40064 (50.5%) zeros | Zeros |
total_uninjured has 19126 (24.1%) zeros | Zeros |
injuries has 26956 (34.0%) zeros | Zeros |
fatality_percentage has 40061 (50.5%) zeros | Zeros |
survived has 7829 (9.9%) zeros | Zeros |
Reproduction
| Analysis started | 2021-09-06 20:05:28.358153 |
|---|---|
| Analysis finished | 2021-09-06 20:05:54.773251 |
| Duration | 26.42 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
Unnamed: 0
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONUNIFORMUNIQUE| Distinct | 79293 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39646 |
| Minimum | 0 |
|---|---|
| Maximum | 79292 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3964.6 |
| Q1 | 19823 |
| median | 39646 |
| Q3 | 59469 |
| 95-th percentile | 75327.4 |
| Maximum | 79292 |
| Range | 79292 |
| Interquartile range (IQR) | 39646 |
Descriptive statistics
| Standard deviation | 22890.06178 |
|---|---|
| Coefficient of variation (CV) | 0.5773611912 |
| Kurtosis | -1.2 |
| Mean | 39646 |
| Median Absolute Deviation (MAD) | 19823 |
| Skewness | -2.103152104 × 10-17 |
| Sum | 3143650278 |
| Variance | 523954928.5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2047 | 1 | < 0.1% |
| 4759 | 1 | < 0.1% |
| 10896 | 1 | < 0.1% |
| 8849 | 1 | < 0.1% |
| 14994 | 1 | < 0.1% |
| 12947 | 1 | < 0.1% |
| 2708 | 1 | < 0.1% |
| 661 | 1 | < 0.1% |
| 6806 | 1 | < 0.1% |
| 27288 | 1 | < 0.1% |
| Other values (79283) | 79283 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 79292 | 1 | |
| 79291 | 1 | |
| 79290 | 1 | |
| 79289 | 1 | |
| 79288 | 1 | |
| 79287 | 1 | |
| 79286 | 1 | |
| 79285 | 1 | |
| 79284 | 1 | |
| 79283 | 1 |
| Distinct | 12638 |
|---|---|
| Distinct (%) | 15.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 619.6 KiB |
| 1982-05-16 | 25 |
|---|---|
| 1984-06-30 | 25 |
| 2000-07-08 | 25 |
| 1983-06-05 | 24 |
| 1983-08-05 | 24 |
| Other values (12633) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 792930 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 542 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | 1982-06-13 |
|---|---|
| 2nd row | 1982-07-01 |
| 3rd row | 1982-07-16 |
| 4th row | 1982-08-21 |
| 5th row | 1982-08-24 |
Common Values
| Value | Count | Frequency (%) |
| 1982-05-16 | 25 | < 0.1% |
| 1984-06-30 | 25 | < 0.1% |
| 2000-07-08 | 25 | < 0.1% |
| 1983-06-05 | 24 | < 0.1% |
| 1983-08-05 | 24 | < 0.1% |
| 1986-05-17 | 24 | < 0.1% |
| 1984-08-25 | 24 | < 0.1% |
| 1983-05-28 | 23 | < 0.1% |
| 1988-08-07 | 23 | < 0.1% |
| 1982-10-03 | 23 | < 0.1% |
| Other values (12628) | 79053 |
Length
| Value | Count | Frequency (%) |
| 1982-05-16 | 25 | < 0.1% |
| 1984-06-30 | 25 | < 0.1% |
| 2000-07-08 | 25 | < 0.1% |
| 1983-06-05 | 24 | < 0.1% |
| 1983-08-05 | 24 | < 0.1% |
| 1986-05-17 | 24 | < 0.1% |
| 1984-08-25 | 24 | < 0.1% |
| 1983-05-28 | 23 | < 0.1% |
| 1988-08-07 | 23 | < 0.1% |
| 1982-10-03 | 23 | < 0.1% |
| Other values (12628) | 79053 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 159256 | |
| - | 158586 | |
| 1 | 126056 | |
| 9 | 92245 | |
| 2 | 84143 | |
| 8 | 48463 | 6.1% |
| 3 | 27198 | 3.4% |
| 6 | 24806 | 3.1% |
| 7 | 24363 | 3.1% |
| 5 | 24355 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 634344 | |
| Dash Punctuation | 158586 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 159256 | |
| 1 | 126056 | |
| 9 | 92245 | |
| 2 | 84143 | |
| 8 | 48463 | 7.6% |
| 3 | 27198 | 4.3% |
| 6 | 24806 | 3.9% |
| 7 | 24363 | 3.8% |
| 5 | 24355 | 3.8% |
| 4 | 23459 | 3.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 158586 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 792930 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 159256 | |
| - | 158586 | |
| 1 | 126056 | |
| 9 | 92245 | |
| 2 | 84143 | |
| 8 | 48463 | 6.1% |
| 3 | 27198 | 3.4% |
| 6 | 24806 | 3.1% |
| 7 | 24363 | 3.1% |
| 5 | 24355 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 792930 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 159256 | |
| - | 158586 | |
| 1 | 126056 | |
| 9 | 92245 | |
| 2 | 84143 | |
| 8 | 48463 | 6.1% |
| 3 | 27198 | 3.4% |
| 6 | 24806 | 3.1% |
| 7 | 24363 | 3.1% |
| 5 | 24355 | 3.1% |
| Distinct | 25264 |
|---|---|
| Distinct (%) | 31.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 619.6 KiB |
| ANCHORAGE, AK | 372 |
|---|---|
| MIAMI, FL | 185 |
| CHICAGO, IL | 169 |
| ALBUQUERQUE, NM | 165 |
| HOUSTON, TX | 155 |
| Other values (25259) |
Length
| Max length | 61 |
|---|---|
| Median length | 12 |
| Mean length | 12.93819127 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1025908 |
|---|---|
| Distinct characters | 82 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 14896 ? |
|---|---|
| Unique (%) | 18.8% |
Sample
| 1st row | CAMBRIA, NY |
|---|---|
| 2nd row | MCWHORTER, KY |
| 3rd row | FREDERICK, MD |
| 4th row | VENTURA, CA |
| 5th row | SIDNEY, NE |
Common Values
| Value | Count | Frequency (%) |
| ANCHORAGE, AK | 372 | 0.5% |
| MIAMI, FL | 185 | 0.2% |
| CHICAGO, IL | 169 | 0.2% |
| ALBUQUERQUE, NM | 165 | 0.2% |
| HOUSTON, TX | 155 | 0.2% |
| Anchorage, AK | 140 | 0.2% |
| FAIRBANKS, AK | 138 | 0.2% |
| ORLANDO, FL | 114 | 0.1% |
| TUCSON, AZ | 107 | 0.1% |
| ENGLEWOOD, CO | 107 | 0.1% |
| Other values (25254) | 77641 |
Length
| Value | Count | Frequency (%) |
| ca | 8179 | 4.5% |
| tx | 5265 | 2.9% |
| fl | 5228 | 2.9% |
| ak | 5171 | 2.9% |
| az | 2554 | 1.4% |
| co | 2508 | 1.4% |
| wa | 2394 | 1.3% |
| il | 1914 | 1.1% |
| mi | 1896 | 1.0% |
| city | 1883 | 1.0% |
| Other values (12575) | 143931 |
Most occurring characters
| Value | Count | Frequency (%) |
| 101630 | 9.9% | |
| , | 79108 | 7.7% |
| A | 74922 | 7.3% |
| N | 47000 | 4.6% |
| L | 45800 | 4.5% |
| E | 44628 | 4.4% |
| O | 41931 | 4.1% |
| I | 36346 | 3.5% |
| T | 34118 | 3.3% |
| R | 33380 | 3.3% |
| Other values (72) | 487045 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 609663 | |
| Lowercase Letter | 233302 | 22.7% |
| Space Separator | 101630 | 9.9% |
| Other Punctuation | 80593 | 7.9% |
| Decimal Number | 464 | < 0.1% |
| Dash Punctuation | 216 | < 0.1% |
| Open Punctuation | 12 | < 0.1% |
| Close Punctuation | 12 | < 0.1% |
| Format | 9 | < 0.1% |
| Control | 5 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 74922 | 12.3% |
| N | 47000 | 7.7% |
| L | 45800 | 7.5% |
| E | 44628 | 7.3% |
| O | 41931 | 6.9% |
| I | 36346 | 6.0% |
| T | 34118 | 5.6% |
| R | 33380 | 5.5% |
| C | 33249 | 5.5% |
| S | 30350 | 5.0% |
| Other values (18) | 187939 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 29434 | |
| e | 25946 | |
| n | 21525 | |
| o | 19646 | 8.4% |
| l | 17958 | 7.7% |
| i | 17473 | 7.5% |
| r | 17236 | 7.4% |
| t | 13286 | 5.7% |
| s | 11330 | 4.9% |
| d | 7736 | 3.3% |
| Other values (16) | 51732 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 83 | |
| 0 | 61 | |
| 2 | 60 | |
| 5 | 53 | |
| 3 | 45 | |
| 4 | 37 | |
| 7 | 34 | |
| 6 | 33 | 7.1% |
| 8 | 31 | 6.7% |
| 9 | 27 | 5.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 79108 | |
| . | 1211 | 1.5% |
| ' | 166 | 0.2% |
| ? | 73 | 0.1% |
| / | 29 | < 0.1% |
| # | 3 | < 0.1% |
| § | 2 | < 0.1% |
| & | 1 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| | 2 | |
| | 2 | |
| | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 101630 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 216 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 12 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 12 |
Format
| Value | Count | Frequency (%) |
| | 9 |
Math Symbol
| Value | Count | Frequency (%) |
| ± | 1 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 842965 | |
| Common | 182943 | 17.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 74922 | 8.9% |
| N | 47000 | 5.6% |
| L | 45800 | 5.4% |
| E | 44628 | 5.3% |
| O | 41931 | 5.0% |
| I | 36346 | 4.3% |
| T | 34118 | 4.0% |
| R | 33380 | 4.0% |
| C | 33249 | 3.9% |
| S | 30350 | 3.6% |
| Other values (44) | 421241 |
Common
| Value | Count | Frequency (%) |
| 101630 | ||
| , | 79108 | |
| . | 1211 | 0.7% |
| - | 216 | 0.1% |
| ' | 166 | 0.1% |
| 1 | 83 | < 0.1% |
| ? | 73 | < 0.1% |
| 0 | 61 | < 0.1% |
| 2 | 60 | < 0.1% |
| 5 | 53 | < 0.1% |
| Other values (18) | 282 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1025878 | |
| Latin 1 Sup | 30 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 101630 | 9.9% | |
| , | 79108 | 7.7% |
| A | 74922 | 7.3% |
| N | 47000 | 4.6% |
| L | 45800 | 4.5% |
| E | 44628 | 4.4% |
| O | 41931 | 4.1% |
| I | 36346 | 3.5% |
| T | 34118 | 3.3% |
| R | 33380 | 3.3% |
| Other values (65) | 487015 |
Latin 1 Sup
| Value | Count | Frequency (%) |
| Â | 14 | |
| | 9 | |
| § | 2 | 6.7% |
| | 2 | 6.7% |
| Ã | 1 | 3.3% |
| ± | 1 | 3.3% |
| | 1 | 3.3% |
| Distinct | 124 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 619.6 KiB |
| Non-Fatal | |
|---|---|
| Fatal(1) | |
| Fatal(2) | 4618 |
| Incident | 3175 |
| Fatal(3) | 1450 |
| Other values (119) | 2195 |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 8.769513072 |
| Min length | 8 |
Characters and Unicode
| Total characters | 695361 |
|---|---|
| Distinct characters | 28 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 67 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Non-Fatal |
|---|---|
| 2nd row | Non-Fatal |
| 3rd row | Fatal(1) |
| 4th row | Non-Fatal |
| 5th row | Non-Fatal |
Common Values
| Value | Count | Frequency (%) |
| Non-Fatal | 60025 | |
| Fatal(1) | 7830 | 9.9% |
| Fatal(2) | 4618 | 5.8% |
| Incident | 3175 | 4.0% |
| Fatal(3) | 1450 | 1.8% |
| Fatal(4) | 1012 | 1.3% |
| Fatal(5) | 311 | 0.4% |
| Unavailable | 220 | 0.3% |
| Fatal(6) | 196 | 0.2% |
| Fatal(7) | 83 | 0.1% |
| Other values (114) | 373 | 0.5% |
Length
| Value | Count | Frequency (%) |
| non-fatal | 60025 | |
| fatal(1 | 7830 | 9.9% |
| fatal(2 | 4618 | 5.8% |
| incident | 3175 | 4.0% |
| fatal(3 | 1450 | 1.8% |
| fatal(4 | 1012 | 1.3% |
| fatal(5 | 311 | 0.4% |
| unavailable | 220 | 0.3% |
| fatal(6 | 196 | 0.2% |
| fatal(7 | 83 | 0.1% |
| Other values (114) | 373 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 152456 | |
| t | 79073 | |
| l | 76338 | |
| F | 75898 | |
| n | 66595 | |
| N | 60025 | 8.6% |
| o | 60025 | 8.6% |
| - | 60025 | 8.6% |
| ( | 15873 | 2.3% |
| ) | 15873 | 2.3% |
| Other values (18) | 33180 | 4.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 448067 | |
| Uppercase Letter | 139318 | 20.0% |
| Dash Punctuation | 60025 | 8.6% |
| Decimal Number | 16205 | 2.3% |
| Open Punctuation | 15873 | 2.3% |
| Close Punctuation | 15873 | 2.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 152456 | |
| t | 79073 | |
| l | 76338 | |
| n | 66595 | |
| o | 60025 | 13.4% |
| i | 3395 | 0.8% |
| e | 3395 | 0.8% |
| c | 3175 | 0.7% |
| d | 3175 | 0.7% |
| v | 220 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 8034 | |
| 2 | 4692 | |
| 3 | 1498 | 9.2% |
| 4 | 1064 | 6.6% |
| 5 | 354 | 2.2% |
| 6 | 222 | 1.4% |
| 7 | 115 | 0.7% |
| 8 | 94 | 0.6% |
| 0 | 71 | 0.4% |
| 9 | 61 | 0.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 75898 | |
| N | 60025 | |
| I | 3175 | 2.3% |
| U | 220 | 0.2% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 60025 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 15873 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 15873 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 587385 | |
| Common | 107976 | 15.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 152456 | |
| t | 79073 | |
| l | 76338 | |
| F | 75898 | |
| n | 66595 | |
| N | 60025 | 10.2% |
| o | 60025 | 10.2% |
| i | 3395 | 0.6% |
| e | 3395 | 0.6% |
| I | 3175 | 0.5% |
| Other values (5) | 7010 | 1.2% |
Common
| Value | Count | Frequency (%) |
| - | 60025 | |
| ( | 15873 | 14.7% |
| ) | 15873 | 14.7% |
| 1 | 8034 | 7.4% |
| 2 | 4692 | 4.3% |
| 3 | 1498 | 1.4% |
| 4 | 1064 | 1.0% |
| 5 | 354 | 0.3% |
| 6 | 222 | 0.2% |
| 7 | 115 | 0.1% |
| Other values (3) | 226 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 695361 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 152456 | |
| t | 79073 | |
| l | 76338 | |
| F | 75898 | |
| n | 66595 | |
| N | 60025 | 8.6% |
| o | 60025 | 8.6% |
| - | 60025 | 8.6% |
| ( | 15873 | 2.3% |
| ) | 15873 | 2.3% |
| Other values (18) | 33180 | 4.8% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2410 |
| Missing (%) | 3.0% |
| Memory size | 619.6 KiB |
| Substantial | |
|---|---|
| Destroyed | |
| Minor | 2512 |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 10.3533551 |
| Min length | 5 |
Characters and Unicode
| Total characters | 795997 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Destroyed |
|---|---|
| 2nd row | Destroyed |
| 3rd row | Destroyed |
| 4th row | Destroyed |
| 5th row | Substantial |
Common Values
| Value | Count | Frequency (%) |
| Substantial | 57049 | |
| Destroyed | 17322 | 21.8% |
| Minor | 2512 | 3.2% |
| (Missing) | 2410 | 3.0% |
Length
Pie chart
| Value | Count | Frequency (%) |
| substantial | 57049 | |
| destroyed | 17322 | 22.5% |
| minor | 2512 | 3.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 131420 | |
| a | 114098 | |
| s | 74371 | |
| n | 59561 | |
| i | 59561 | |
| S | 57049 | |
| u | 57049 | |
| b | 57049 | |
| l | 57049 | |
| e | 34644 | 4.4% |
| Other values (6) | 94146 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 719114 | |
| Uppercase Letter | 76883 | 9.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 131420 | |
| a | 114098 | |
| s | 74371 | |
| n | 59561 | |
| i | 59561 | |
| u | 57049 | |
| b | 57049 | |
| l | 57049 | |
| e | 34644 | 4.8% |
| r | 19834 | 2.8% |
| Other values (3) | 54478 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 57049 | |
| D | 17322 | 22.5% |
| M | 2512 | 3.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 795997 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 131420 | |
| a | 114098 | |
| s | 74371 | |
| n | 59561 | |
| i | 59561 | |
| S | 57049 | |
| u | 57049 | |
| b | 57049 | |
| l | 57049 | |
| e | 34644 | 4.4% |
| Other values (6) | 94146 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 795997 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 131420 | |
| a | 114098 | |
| s | 74371 | |
| n | 59561 | |
| i | 59561 | |
| S | 57049 | |
| u | 57049 | |
| b | 57049 | |
| l | 57049 | |
| e | 34644 | 4.4% |
| Other values (6) | 94146 |
| Distinct | 6707 |
|---|---|
| Distinct (%) | 8.5% |
| Missing | 89 |
| Missing (%) | 0.1% |
| Memory size | 619.6 KiB |
| CESSNA | |
|---|---|
| PIPER | |
| BEECH | |
| BELL | 2467 |
| BOEING | 2153 |
| Other values (6702) |
Length
| Max length | 33 |
|---|---|
| Median length | 6 |
| Mean length | 7.494432099 |
| Min length | 2 |
Characters and Unicode
| Total characters | 593589 |
|---|---|
| Distinct characters | 47 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5425 ? |
|---|---|
| Unique (%) | 6.8% |
Sample
| 1st row | 107.5 FLYING CORPORATION |
|---|---|
| 2nd row | 1200 |
| 3rd row | 177MF LLC |
| 4th row | 1977 COLFER-CHAN |
| 5th row | 1ST FTR GP |
Common Values
| Value | Count | Frequency (%) |
| CESSNA | 24847 | |
| PIPER | 13529 | |
| BEECH | 4881 | 6.2% |
| BELL | 2467 | 3.1% |
| BOEING | 2153 | 2.7% |
| MOONEY | 1209 | 1.5% |
| GRUMMAN | 1126 | 1.4% |
| ROBINSON | 1005 | 1.3% |
| BELLANCA | 987 | 1.2% |
| HUGHES | 881 | 1.1% |
| Other values (6697) | 26119 |
Length
| Value | Count | Frequency (%) |
| cessna | 24888 | |
| piper | 13569 | 14.3% |
| beech | 4889 | 5.2% |
| bell | 2514 | 2.7% |
| boeing | 2231 | 2.4% |
| grumman | 1456 | 1.5% |
| robinson | 1305 | 1.4% |
| mooney | 1249 | 1.3% |
| bellanca | 989 | 1.0% |
| american | 956 | 1.0% |
| Other values (5893) | 40779 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 82988 | |
| S | 66877 | |
| A | 56791 | |
| N | 50502 | 8.5% |
| C | 47336 | 8.0% |
| R | 44285 | 7.5% |
| I | 35920 | 6.1% |
| P | 32692 | 5.5% |
| O | 26516 | 4.5% |
| L | 22719 | 3.8% |
| Other values (37) | 126963 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 573692 | |
| Space Separator | 15621 | 2.6% |
| Other Punctuation | 2496 | 0.4% |
| Dash Punctuation | 989 | 0.2% |
| Open Punctuation | 338 | 0.1% |
| Close Punctuation | 336 | 0.1% |
| Decimal Number | 112 | < 0.1% |
| Math Symbol | 5 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 82988 | |
| S | 66877 | |
| A | 56791 | |
| N | 50502 | |
| C | 47336 | |
| R | 44285 | |
| I | 35920 | 6.3% |
| P | 32692 | 5.7% |
| O | 26516 | 4.6% |
| L | 22719 | 4.0% |
| Other values (16) | 107066 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 20 | |
| 0 | 17 | |
| 7 | 17 | |
| 2 | 13 | |
| 5 | 10 | |
| 3 | 9 | |
| 6 | 9 | |
| 8 | 7 | 6.2% |
| 4 | 6 | 5.4% |
| 9 | 4 | 3.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1564 | |
| , | 321 | 12.9% |
| / | 283 | 11.3% |
| & | 271 | 10.9% |
| ' | 35 | 1.4% |
| ? | 22 | 0.9% |
Space Separator
| Value | Count | Frequency (%) |
| 15621 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 989 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 338 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 336 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 573692 | |
| Common | 19897 | 3.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 82988 | |
| S | 66877 | |
| A | 56791 | |
| N | 50502 | |
| C | 47336 | |
| R | 44285 | |
| I | 35920 | 6.3% |
| P | 32692 | 5.7% |
| O | 26516 | 4.6% |
| L | 22719 | 4.0% |
| Other values (16) | 107066 |
Common
| Value | Count | Frequency (%) |
| 15621 | ||
| . | 1564 | 7.9% |
| - | 989 | 5.0% |
| ( | 338 | 1.7% |
| ) | 336 | 1.7% |
| , | 321 | 1.6% |
| / | 283 | 1.4% |
| & | 271 | 1.4% |
| ' | 35 | 0.2% |
| ? | 22 | 0.1% |
| Other values (11) | 117 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 593589 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 82988 | |
| S | 66877 | |
| A | 56791 | |
| N | 50502 | 8.5% |
| C | 47336 | 8.0% |
| R | 44285 | 7.5% |
| I | 35920 | 6.1% |
| P | 32692 | 5.5% |
| O | 26516 | 4.5% |
| L | 22719 | 3.8% |
| Other values (37) | 126963 |
| Distinct | 11330 |
|---|---|
| Distinct (%) | 14.3% |
| Missing | 118 |
| Missing (%) | 0.1% |
| Memory size | 619.6 KiB |
| 152 | 2278 |
|---|---|
| 172 | 1263 |
| 172N | 1133 |
| PA-28-140 | 900 |
| 172M | 775 |
| Other values (11325) |
Length
| Max length | 20 |
|---|---|
| Median length | 5 |
| Mean length | 5.828215977 |
| Min length | 1 |
Characters and Unicode
| Total characters | 461449 |
|---|---|
| Distinct characters | 82 |
| Distinct categories | 10 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 7068 ? |
|---|---|
| Unique (%) | 8.9% |
Sample
| 1st row | 64 |
|---|---|
| 2nd row | KR-2 |
| 3rd row | WINDWAGON |
| 4th row | MIDGET MUSTANG |
| 5th row | SKYBOLT |
Common Values
| Value | Count | Frequency (%) |
| 152 | 2278 | 2.9% |
| 172 | 1263 | 1.6% |
| 172N | 1133 | 1.4% |
| PA-28-140 | 900 | 1.1% |
| 172M | 775 | 1.0% |
| 150 | 725 | 0.9% |
| 172P | 669 | 0.8% |
| 150M | 581 | 0.7% |
| PA-18 | 573 | 0.7% |
| PA-28-161 | 558 | 0.7% |
| Other values (11320) | 69720 |
Length
| Value | Count | Frequency (%) |
| 152 | 2303 | 2.5% |
| 172 | 1329 | 1.5% |
| 172n | 1135 | 1.2% |
| ii | 933 | 1.0% |
| pa-28-140 | 901 | 1.0% |
| 172m | 775 | 0.9% |
| 150 | 758 | 0.8% |
| 172p | 672 | 0.7% |
| 150m | 581 | 0.6% |
| pa-18 | 577 | 0.6% |
| Other values (8845) | 80852 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 45859 | 9.9% |
| 2 | 45323 | 9.8% |
| - | 43474 | 9.4% |
| 0 | 33868 | 7.3% |
| A | 31738 | 6.9% |
| 5 | 19580 | 4.2% |
| 8 | 18527 | 4.0% |
| 3 | 18109 | 3.9% |
| P | 17213 | 3.7% |
| 7 | 17205 | 3.7% |
| Other values (72) | 170553 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 223981 | |
| Uppercase Letter | 166764 | |
| Dash Punctuation | 43474 | 9.4% |
| Lowercase Letter | 14723 | 3.2% |
| Space Separator | 11641 | 2.5% |
| Other Punctuation | 457 | 0.1% |
| Open Punctuation | 181 | < 0.1% |
| Close Punctuation | 177 | < 0.1% |
| Math Symbol | 50 | < 0.1% |
| Control | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 31738 | |
| P | 17213 | 10.3% |
| R | 11139 | 6.7% |
| C | 10766 | 6.5% |
| B | 10571 | 6.3% |
| T | 8969 | 5.4% |
| S | 8853 | 5.3% |
| E | 7887 | 4.7% |
| I | 6869 | 4.1% |
| M | 6425 | 3.9% |
| Other values (16) | 46334 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1806 | |
| e | 1649 | |
| r | 1497 | |
| i | 1305 | 8.9% |
| t | 1184 | 8.0% |
| o | 1061 | 7.2% |
| n | 998 | 6.8% |
| l | 709 | 4.8% |
| s | 662 | 4.5% |
| c | 489 | 3.3% |
| Other values (16) | 3363 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 252 | |
| . | 150 | |
| " | 18 | 3.9% |
| ' | 17 | 3.7% |
| # | 6 | 1.3% |
| , | 6 | 1.3% |
| & | 4 | 0.9% |
| ; | 1 | 0.2% |
| \ | 1 | 0.2% |
| : | 1 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 45859 | |
| 2 | 45323 | |
| 0 | 33868 | |
| 5 | 19580 | |
| 8 | 18527 | |
| 3 | 18109 | 8.1% |
| 7 | 17205 | 7.7% |
| 4 | 10970 | 4.9% |
| 6 | 10949 | 4.9% |
| 9 | 3591 | 1.6% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 180 | |
| [ | 1 | 0.6% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 176 | |
| ] | 1 | 0.6% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 48 | |
| = | 2 | 4.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 43474 |
Space Separator
| Value | Count | Frequency (%) |
| 11641 |
Control
| Value | Count | Frequency (%) |
| | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 279962 | |
| Latin | 181487 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 31738 | |
| P | 17213 | 9.5% |
| R | 11139 | 6.1% |
| C | 10766 | 5.9% |
| B | 10571 | 5.8% |
| T | 8969 | 4.9% |
| S | 8853 | 4.9% |
| E | 7887 | 4.3% |
| I | 6869 | 3.8% |
| M | 6425 | 3.5% |
| Other values (42) | 61057 |
Common
| Value | Count | Frequency (%) |
| 1 | 45859 | |
| 2 | 45323 | |
| - | 43474 | |
| 0 | 33868 | |
| 5 | 19580 | |
| 8 | 18527 | |
| 3 | 18109 | 6.5% |
| 7 | 17205 | 6.1% |
| 11641 | 4.2% | |
| 4 | 10970 | 3.9% |
| Other values (20) | 15406 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 461449 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 45859 | 9.9% |
| 2 | 45323 | 9.8% |
| - | 43474 | 9.4% |
| 0 | 33868 | 7.3% |
| A | 31738 | 6.9% |
| 5 | 19580 | 4.2% |
| 8 | 18527 | 4.0% |
| 3 | 18109 | 3.9% |
| P | 17213 | 3.7% |
| 7 | 17205 | 3.7% |
| Other values (72) | 170553 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 572 |
| Missing (%) | 0.7% |
| Memory size | 155.0 KiB |
| False | |
|---|---|
| True | |
| (Missing) | 572 |
| Value | Count | Frequency (%) |
| False | 71105 | |
| True | 7616 | 9.6% |
| (Missing) | 572 | 0.7% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3986 |
| Missing (%) | 5.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.146042201 |
| Minimum | 0 |
|---|---|
| Maximum | 18 |
| Zeros | 1275 |
| Zeros (%) | 1.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 18 |
| Range | 18 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.4559854315 |
|---|---|
| Coefficient of variation (CV) | 0.3978783951 |
| Kurtosis | 34.30191777 |
| Mean | 1.146042201 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.02838621 |
| Sum | 86305 |
| Variance | 0.2079227137 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 63082 | |
| 2 | 10057 | 12.7% |
| 0 | 1275 | 1.6% |
| 3 | 477 | 0.6% |
| 4 | 415 | 0.5% |
| 18 | 1 | < 0.1% |
| (Missing) | 3986 | 5.0% |
| Value | Count | Frequency (%) |
| 0 | 1275 | 1.6% |
| 1 | 63082 | |
| 2 | 10057 | 12.7% |
| 3 | 477 | 0.6% |
| 4 | 415 | 0.5% |
| 18 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 18 | 1 | < 0.1% |
| 4 | 415 | 0.5% |
| 3 | 477 | 0.6% |
| 2 | 10057 | 12.7% |
| 1 | 63082 | |
| 0 | 1275 | 1.6% |
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3374 |
| Missing (%) | 4.3% |
| Memory size | 619.6 KiB |
| Reciprocating | |
|---|---|
| Turbo Shaft | 3305 |
| Turbo Prop | 3042 |
| Turbo Fan | 2226 |
| Unknown | 2052 |
| Other values (9) | 696 |
Length
| Max length | 16 |
|---|---|
| Median length | 13 |
| Mean length | 12.47633662 |
| Min length | 4 |
Characters and Unicode
| Total characters | 947191 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Reciprocating |
|---|---|
| 2nd row | Reciprocating |
| 3rd row | Reciprocating |
| 4th row | Reciprocating |
| 5th row | Reciprocating |
Common Values
| Value | Count | Frequency (%) |
| Reciprocating | 64598 | |
| Turbo Shaft | 3305 | 4.2% |
| Turbo Prop | 3042 | 3.8% |
| Turbo Fan | 2226 | 2.8% |
| Unknown | 2052 | 2.6% |
| Turbo Jet | 678 | 0.9% |
| None | 6 | < 0.1% |
| Electric | 3 | < 0.1% |
| TF, TJ | 3 | < 0.1% |
| REC, TJ, TJ | 2 | < 0.1% |
| Other values (4) | 4 | < 0.1% |
| (Missing) | 3374 | 4.3% |
Length
| Value | Count | Frequency (%) |
| reciprocating | 64598 | |
| turbo | 9251 | 10.9% |
| shaft | 3305 | 3.9% |
| prop | 3042 | 3.6% |
| fan | 2226 | 2.6% |
| unknown | 2052 | 2.4% |
| jet | 678 | 0.8% |
| tj | 11 | < 0.1% |
| rec | 7 | < 0.1% |
| none | 6 | < 0.1% |
| Other values (5) | 9 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 129203 | |
| i | 129200 | |
| o | 78950 | |
| r | 76895 | |
| n | 72986 | |
| a | 70129 | |
| t | 68585 | |
| p | 67640 | |
| e | 65286 | |
| R | 64606 | |
| Other values (23) | 123711 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 852695 | |
| Uppercase Letter | 85216 | 9.0% |
| Space Separator | 9266 | 1.0% |
| Other Punctuation | 14 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 129203 | |
| i | 129200 | |
| o | 78950 | |
| r | 76895 | |
| n | 72986 | |
| a | 70129 | |
| t | 68585 | |
| p | 67640 | |
| e | 65286 | |
| g | 64598 | |
| Other values (9) | 29223 | 3.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 64606 | |
| T | 9265 | 10.9% |
| S | 3305 | 3.9% |
| P | 3042 | 3.6% |
| F | 2229 | 2.6% |
| U | 2052 | 2.4% |
| J | 689 | 0.8% |
| E | 12 | < 0.1% |
| C | 8 | < 0.1% |
| N | 6 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 9266 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 14 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 937911 | |
| Common | 9280 | 1.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 129203 | |
| i | 129200 | |
| o | 78950 | |
| r | 76895 | |
| n | 72986 | |
| a | 70129 | |
| t | 68585 | |
| p | 67640 | |
| e | 65286 | |
| R | 64606 | |
| Other values (21) | 114431 |
Common
| Value | Count | Frequency (%) |
| 9266 | ||
| , | 14 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 947191 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 129203 | |
| i | 129200 | |
| o | 78950 | |
| r | 76895 | |
| n | 72986 | |
| a | 70129 | |
| t | 68585 | |
| p | 67640 | |
| e | 65286 | |
| R | 64606 | |
| Other values (23) | 123711 |
total_fatal_injuries
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGSKEWEDZEROS| Distinct | 122 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 23309 |
| Missing (%) | 29.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.8146791941 |
| Minimum | 0 |
|---|---|
| Maximum | 349 |
| Zeros | 40092 |
| Zeros (%) | 50.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 349 |
| Range | 349 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 6.23370003 |
|---|---|
| Coefficient of variation (CV) | 7.651723618 |
| Kurtosis | 1082.130068 |
| Mean | 0.8146791941 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 29.51903889 |
| Sum | 45609 |
| Variance | 38.85901607 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 40092 | |
| 1 | 7847 | 9.9% |
| 2 | 4619 | 5.8% |
| 3 | 1451 | 1.8% |
| 4 | 1012 | 1.3% |
| 5 | 311 | 0.4% |
| 6 | 196 | 0.2% |
| 7 | 83 | 0.1% |
| 8 | 65 | 0.1% |
| 10 | 42 | 0.1% |
| Other values (112) | 266 | 0.3% |
| (Missing) | 23309 |
| Value | Count | Frequency (%) |
| 0 | 40092 | |
| 1 | 7847 | 9.9% |
| 2 | 4619 | 5.8% |
| 3 | 1451 | 1.8% |
| 4 | 1012 | 1.3% |
| 5 | 311 | 0.4% |
| 6 | 196 | 0.2% |
| 7 | 83 | 0.1% |
| 8 | 65 | 0.1% |
| 9 | 36 | < 0.1% |
| Value | Count | Frequency (%) |
| 349 | 2 | |
| 295 | 1 | |
| 270 | 1 | |
| 265 | 1 | |
| 256 | 1 | |
| 239 | 1 | |
| 230 | 1 | |
| 229 | 1 | |
| 228 | 2 | |
| 224 | 1 |
| Distinct | 40 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 25551 |
| Missing (%) | 32.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3177031 |
| Minimum | 0 |
|---|---|
| Maximum | 111 |
| Zeros | 42660 |
| Zeros (%) | 53.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 111 |
| Range | 111 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.372924237 |
|---|---|
| Coefficient of variation (CV) | 4.321406485 |
| Kurtosis | 2213.530689 |
| Mean | 0.3177031 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 37.45361961 |
| Sum | 17074 |
| Variance | 1.884920959 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 42660 | |
| 1 | 8016 | 10.1% |
| 2 | 2177 | 2.7% |
| 3 | 502 | 0.6% |
| 4 | 205 | 0.3% |
| 5 | 65 | 0.1% |
| 6 | 25 | < 0.1% |
| 7 | 22 | < 0.1% |
| 8 | 7 | < 0.1% |
| 10 | 7 | < 0.1% |
| Other values (30) | 56 | 0.1% |
| (Missing) | 25551 |
| Value | Count | Frequency (%) |
| 0 | 42660 | |
| 1 | 8016 | 10.1% |
| 2 | 2177 | 2.7% |
| 3 | 502 | 0.6% |
| 4 | 205 | 0.3% |
| 5 | 65 | 0.1% |
| 6 | 25 | < 0.1% |
| 7 | 22 | < 0.1% |
| 8 | 7 | < 0.1% |
| 9 | 7 | < 0.1% |
| Value | Count | Frequency (%) |
| 111 | 1 | < 0.1% |
| 106 | 1 | < 0.1% |
| 81 | 1 | < 0.1% |
| 66 | 1 | < 0.1% |
| 60 | 1 | < 0.1% |
| 59 | 2 | |
| 55 | 1 | < 0.1% |
| 50 | 3 | |
| 47 | 1 | < 0.1% |
| 45 | 1 | < 0.1% |
total_minor_injuries
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGSKEWEDZEROS| Distinct | 62 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 24460 |
| Missing (%) | 30.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5025805628 |
| Minimum | 0 |
|---|---|
| Maximum | 380 |
| Zeros | 40064 |
| Zeros (%) | 50.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 380 |
| Range | 380 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 2.781994444 |
|---|---|
| Coefficient of variation (CV) | 5.53541989 |
| Kurtosis | 7360.610457 |
| Mean | 0.5025805628 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 66.84890696 |
| Sum | 27558 |
| Variance | 7.739493085 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 40064 | |
| 1 | 9519 | 12.0% |
| 2 | 3575 | 4.5% |
| 3 | 799 | 1.0% |
| 4 | 389 | 0.5% |
| 5 | 136 | 0.2% |
| 6 | 72 | 0.1% |
| 7 | 58 | 0.1% |
| 9 | 28 | < 0.1% |
| 8 | 23 | < 0.1% |
| Other values (52) | 170 | 0.2% |
| (Missing) | 24460 |
| Value | Count | Frequency (%) |
| 0 | 40064 | |
| 1 | 9519 | 12.0% |
| 2 | 3575 | 4.5% |
| 3 | 799 | 1.0% |
| 4 | 389 | 0.5% |
| 5 | 136 | 0.2% |
| 6 | 72 | 0.1% |
| 7 | 58 | 0.1% |
| 8 | 23 | < 0.1% |
| 9 | 28 | < 0.1% |
| Value | Count | Frequency (%) |
| 380 | 1 | |
| 200 | 1 | |
| 171 | 1 | |
| 137 | 1 | |
| 125 | 1 | |
| 96 | 1 | |
| 88 | 1 | |
| 84 | 1 | |
| 71 | 1 | |
| 69 | 1 |
total_uninjured
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 364 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 12344 |
| Missing (%) | 15.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.790885599 |
| Minimum | 0 |
|---|---|
| Maximum | 699 |
| Zeros | 19126 |
| Zeros (%) | 24.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 6 |
| Maximum | 699 |
| Range | 699 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 29.22301628 |
|---|---|
| Coefficient of variation (CV) | 5.046381211 |
| Kurtosis | 101.5231507 |
| Mean | 5.790885599 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 8.950895422 |
| Sum | 387694 |
| Variance | 853.9846806 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 22583 | |
| 0 | 19126 | |
| 2 | 14316 | |
| 3 | 3949 | 5.0% |
| 4 | 2508 | 3.2% |
| 5 | 817 | 1.0% |
| 6 | 449 | 0.6% |
| 7 | 253 | 0.3% |
| 8 | 141 | 0.2% |
| 9 | 111 | 0.1% |
| Other values (354) | 2696 | 3.4% |
| (Missing) | 12344 |
| Value | Count | Frequency (%) |
| 0 | 19126 | |
| 1 | 22583 | |
| 2 | 14316 | |
| 3 | 3949 | 5.0% |
| 4 | 2508 | 3.2% |
| 5 | 817 | 1.0% |
| 6 | 449 | 0.6% |
| 7 | 253 | 0.3% |
| 8 | 141 | 0.2% |
| 9 | 111 | 0.1% |
| Value | Count | Frequency (%) |
| 699 | 2 | |
| 588 | 2 | |
| 576 | 2 | |
| 573 | 2 | |
| 558 | 1 | |
| 528 | 2 | |
| 507 | 1 | |
| 501 | 2 | |
| 495 | 2 | |
| 461 | 2 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 619.6 KiB |
| VMC | |
|---|---|
| IMC | 5660 |
| UNK | 3126 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 237879 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | VMC |
|---|---|
| 2nd row | VMC |
| 3rd row | VMC |
| 4th row | VMC |
| 5th row | VMC |
Common Values
| Value | Count | Frequency (%) |
| VMC | 70507 | |
| IMC | 5660 | 7.1% |
| UNK | 3126 | 3.9% |
Length
Pie chart
| Value | Count | Frequency (%) |
| vmc | 70507 | |
| imc | 5660 | 7.1% |
| unk | 3126 | 3.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 76167 | |
| C | 76167 | |
| V | 70507 | |
| I | 5660 | 2.4% |
| U | 3126 | 1.3% |
| N | 3126 | 1.3% |
| K | 3126 | 1.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 237879 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 76167 | |
| C | 76167 | |
| V | 70507 | |
| I | 5660 | 2.4% |
| U | 3126 | 1.3% |
| N | 3126 | 1.3% |
| K | 3126 | 1.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 237879 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 76167 | |
| C | 76167 | |
| V | 70507 | |
| I | 5660 | 2.4% |
| U | 3126 | 1.3% |
| N | 3126 | 1.3% |
| K | 3126 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 237879 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 76167 | |
| C | 76167 | |
| V | 70507 | |
| I | 5660 | 2.4% |
| U | 3126 | 1.3% |
| N | 3126 | 1.3% |
| K | 3126 | 1.3% |
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 6054 |
| Missing (%) | 7.6% |
| Memory size | 619.6 KiB |
| LANDING | |
|---|---|
| TAKEOFF | |
| CRUISE | |
| MANEUVERING | |
| APPROACH | |
| Other values (7) |
Length
| Max length | 11 |
|---|---|
| Median length | 7 |
| Mean length | 7.393779271 |
| Min length | 4 |
Characters and Unicode
| Total characters | 541513 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CLIMB |
|---|---|
| 2nd row | CRUISE |
| 3rd row | APPROACH |
| 4th row | MANEUVERING |
| 5th row | LANDING |
Common Values
| Value | Count | Frequency (%) |
| LANDING | 19209 | |
| TAKEOFF | 15284 | |
| CRUISE | 10749 | |
| MANEUVERING | 9818 | |
| APPROACH | 7720 | |
| TAXI | 2322 | 2.9% |
| CLIMB | 2279 | 2.9% |
| DESCENT | 2202 | 2.8% |
| GO-AROUND | 1608 | 2.0% |
| STANDING | 1219 | 1.5% |
| Other values (2) | 829 | 1.0% |
| (Missing) | 6054 | 7.6% |
Length
| Value | Count | Frequency (%) |
| landing | 19209 | |
| takeoff | 15284 | |
| cruise | 10749 | |
| maneuvering | 9818 | |
| approach | 7720 | |
| taxi | 2322 | 3.2% |
| climb | 2279 | 3.1% |
| descent | 2202 | 3.0% |
| go-around | 1608 | 2.2% |
| standing | 1219 | 1.7% |
| Other values (2) | 829 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 66318 | |
| A | 64900 | |
| E | 50230 | 9.3% |
| I | 45596 | 8.4% |
| G | 31854 | 5.9% |
| F | 30568 | 5.6% |
| R | 30052 | 5.5% |
| O | 27049 | 5.0% |
| D | 24238 | 4.5% |
| C | 22950 | 4.2% |
| Other values (13) | 147758 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 539905 | |
| Dash Punctuation | 1608 | 0.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 66318 | |
| A | 64900 | |
| E | 50230 | 9.3% |
| I | 45596 | 8.4% |
| G | 31854 | 5.9% |
| F | 30568 | 5.7% |
| R | 30052 | 5.6% |
| O | 27049 | 5.0% |
| D | 24238 | 4.5% |
| C | 22950 | 4.3% |
| Other values (12) | 146150 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1608 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 539905 | |
| Common | 1608 | 0.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 66318 | |
| A | 64900 | |
| E | 50230 | 9.3% |
| I | 45596 | 8.4% |
| G | 31854 | 5.9% |
| F | 30568 | 5.7% |
| R | 30052 | 5.6% |
| O | 27049 | 5.0% |
| D | 24238 | 4.5% |
| C | 22950 | 4.3% |
| Other values (12) | 146150 |
Common
| Value | Count | Frequency (%) |
| - | 1608 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 541513 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 66318 | |
| A | 64900 | |
| E | 50230 | 9.3% |
| I | 45596 | 8.4% |
| G | 31854 | 5.9% |
| F | 30568 | 5.6% |
| R | 30052 | 5.5% |
| O | 27049 | 5.0% |
| D | 24238 | 4.5% |
| C | 22950 | 4.2% |
| Other values (13) | 147758 |
| Distinct | 42 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1996.749827 |
| Minimum | 1948 |
|---|---|
| Maximum | 2017 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.6 KiB |
Quantile statistics
| Minimum | 1948 |
|---|---|
| 5-th percentile | 1983 |
| Q1 | 1988 |
| median | 1996 |
| Q3 | 2005 |
| 95-th percentile | 2014 |
| Maximum | 2017 |
| Range | 69 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 10.11297874 |
|---|---|
| Coefficient of variation (CV) | 0.005064719981 |
| Kurtosis | -1.155373615 |
| Mean | 1996.749827 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 0.2332184642 |
| Sum | 158328284 |
| Variance | 102.2723391 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1982 | 3593 | 4.5% |
| 1983 | 3556 | 4.5% |
| 1984 | 3457 | 4.4% |
| 1985 | 3096 | 3.9% |
| 1986 | 2880 | 3.6% |
| 1987 | 2828 | 3.6% |
| 1988 | 2730 | 3.4% |
| 1989 | 2544 | 3.2% |
| 1990 | 2518 | 3.2% |
| 1991 | 2462 | 3.1% |
| Other values (32) | 49629 |
| Value | Count | Frequency (%) |
| 1948 | 1 | < 0.1% |
| 1962 | 1 | < 0.1% |
| 1974 | 1 | < 0.1% |
| 1977 | 1 | < 0.1% |
| 1979 | 1 | < 0.1% |
| 1981 | 1 | < 0.1% |
| 1982 | 3593 | |
| 1983 | 3556 | |
| 1984 | 3457 | |
| 1985 | 3096 |
| Value | Count | Frequency (%) |
| 2017 | 1 | < 0.1% |
| 2016 | 1409 | |
| 2015 | 1578 | |
| 2014 | 1539 | |
| 2013 | 1555 | |
| 2012 | 1860 | |
| 2011 | 1886 | |
| 2010 | 1818 | |
| 2009 | 1805 | |
| 2008 | 1931 |
Month
Real number (ℝ≥0)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.582787888 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 7 |
| Q3 | 9 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.060089058 |
|---|---|
| Coefficient of variation (CV) | 0.4648621694 |
| Kurtosis | -0.9007937113 |
| Mean | 6.582787888 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.05644265097 |
| Sum | 521969 |
| Variance | 9.364145045 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 9504 | |
| 8 | 8983 | |
| 6 | 8544 | |
| 5 | 7626 | |
| 9 | 7382 | |
| 4 | 6549 | |
| 10 | 6165 | |
| 3 | 5994 | |
| 11 | 4907 | |
| 2 | 4680 | |
| Other values (2) | 8959 |
| Value | Count | Frequency (%) |
| 1 | 4448 | |
| 2 | 4680 | |
| 3 | 5994 | |
| 4 | 6549 | |
| 5 | 7626 | |
| 6 | 8544 | |
| 7 | 9504 | |
| 8 | 8983 | |
| 9 | 7382 | |
| 10 | 6165 |
| Value | Count | Frequency (%) |
| 12 | 4511 | |
| 11 | 4907 | |
| 10 | 6165 | |
| 9 | 7382 | |
| 8 | 8983 | |
| 7 | 9504 | |
| 6 | 8544 | |
| 5 | 7626 | |
| 4 | 6549 | |
| 3 | 5994 |
Day
Real number (ℝ≥0)
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.71744038 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 30 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.831329784 |
|---|---|
| Coefficient of variation (CV) | 0.5618809152 |
| Kurtosis | -1.195828365 |
| Mean | 15.71744038 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.006435342317 |
| Sum | 1246283 |
| Variance | 77.99238576 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 2723 | 3.4% |
| 19 | 2685 | 3.4% |
| 2 | 2675 | 3.4% |
| 16 | 2667 | 3.4% |
| 6 | 2661 | 3.4% |
| 18 | 2646 | 3.3% |
| 17 | 2639 | 3.3% |
| 5 | 2638 | 3.3% |
| 23 | 2635 | 3.3% |
| 8 | 2633 | 3.3% |
| Other values (21) | 52691 |
| Value | Count | Frequency (%) |
| 1 | 2723 | |
| 2 | 2675 | |
| 3 | 2541 | |
| 4 | 2599 | |
| 5 | 2638 | |
| 6 | 2661 | |
| 7 | 2600 | |
| 8 | 2633 | |
| 9 | 2554 | |
| 10 | 2597 |
| Value | Count | Frequency (%) |
| 31 | 1616 | |
| 30 | 2392 | |
| 29 | 2409 | |
| 28 | 2628 | |
| 27 | 2623 | |
| 26 | 2606 | |
| 25 | 2541 | |
| 24 | 2520 | |
| 23 | 2635 | |
| 22 | 2520 |
injuries
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGSKEWEDZEROS| Distinct | 97 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 29557 |
| Missing (%) | 37.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.071155702 |
| Minimum | 0 |
|---|---|
| Maximum | 283 |
| Zeros | 26956 |
| Zeros (%) | 34.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 4 |
| Maximum | 283 |
| Range | 283 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 5.048778214 |
|---|---|
| Coefficient of variation (CV) | 4.713393398 |
| Kurtosis | 1208.600025 |
| Mean | 1.071155702 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 30.77744311 |
| Sum | 53275 |
| Variance | 25.49016146 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 26956 | |
| 1 | 11453 | 14.4% |
| 2 | 6910 | 8.7% |
| 3 | 1880 | 2.4% |
| 4 | 1384 | 1.7% |
| 5 | 395 | 0.5% |
| 6 | 257 | 0.3% |
| 7 | 101 | 0.1% |
| 8 | 79 | 0.1% |
| 10 | 39 | < 0.1% |
| Other values (87) | 282 | 0.4% |
| (Missing) | 29557 |
| Value | Count | Frequency (%) |
| 0 | 26956 | |
| 1 | 11453 | |
| 2 | 6910 | 8.7% |
| 3 | 1880 | 2.4% |
| 4 | 1384 | 1.7% |
| 5 | 395 | 0.5% |
| 6 | 257 | 0.3% |
| 7 | 101 | 0.1% |
| 8 | 79 | 0.1% |
| 9 | 36 | < 0.1% |
| Value | Count | Frequency (%) |
| 283 | 1 | |
| 275 | 1 | |
| 256 | 1 | |
| 231 | 1 | |
| 230 | 1 | |
| 229 | 1 | |
| 190 | 2 | |
| 189 | 1 | |
| 174 | 1 | |
| 171 | 1 |
pax_onboard
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 322 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 29671 |
| Missing (%) | 37.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.286304462 |
| Minimum | 0 |
|---|---|
| Maximum | 528 |
| Zeros | 27 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 6 |
| Maximum | 528 |
| Range | 528 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 24.70287163 |
|---|---|
| Coefficient of variation (CV) | 4.672994492 |
| Kurtosis | 128.5555772 |
| Mean | 5.286304462 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 10.25386491 |
| Sum | 262317 |
| Variance | 610.231867 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 21780 | |
| 2 | 15873 | |
| 3 | 4687 | 5.9% |
| 4 | 3393 | 4.3% |
| 5 | 1043 | 1.3% |
| 6 | 597 | 0.8% |
| 7 | 263 | 0.3% |
| 8 | 168 | 0.2% |
| 9 | 91 | 0.1% |
| 10 | 89 | 0.1% |
| Other values (312) | 1638 | 2.1% |
| (Missing) | 29671 |
| Value | Count | Frequency (%) |
| 0 | 27 | < 0.1% |
| 1 | 21780 | |
| 2 | 15873 | |
| 3 | 4687 | 5.9% |
| 4 | 3393 | 4.3% |
| 5 | 1043 | 1.3% |
| 6 | 597 | 0.8% |
| 7 | 263 | 0.3% |
| 8 | 168 | 0.2% |
| 9 | 91 | 0.1% |
| Value | Count | Frequency (%) |
| 528 | 2 | |
| 507 | 1 | |
| 496 | 2 | |
| 481 | 1 | |
| 468 | 1 | |
| 461 | 2 | |
| 459 | 2 | |
| 441 | 2 | |
| 440 | 2 | |
| 436 | 2 |
fatality_percentage
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 114 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 29698 |
| Missing (%) | 37.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.37817274 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 40061 |
| Zeros (%) | 50.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 100 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 36.86760591 |
|---|---|
| Coefficient of variation (CV) | 2.121489207 |
| Kurtosis | 1.055932869 |
| Mean | 17.37817274 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.722139158 |
| Sum | 861870.477 |
| Variance | 1359.220365 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 40061 | |
| 100 | 7802 | 9.8% |
| 50 | 861 | 1.1% |
| 33.33333333 | 209 | 0.3% |
| 66.66666667 | 157 | 0.2% |
| 25 | 109 | 0.1% |
| 75 | 87 | 0.1% |
| 20 | 44 | 0.1% |
| 40 | 30 | < 0.1% |
| 60 | 25 | < 0.1% |
| Other values (104) | 210 | 0.3% |
| (Missing) | 29698 |
| Value | Count | Frequency (%) |
| 0 | 40061 | |
| 0.2544529262 | 1 | < 0.1% |
| 0.2985074627 | 1 | < 0.1% |
| 0.3584229391 | 1 | < 0.1% |
| 0.4081632653 | 1 | < 0.1% |
| 0.4926108374 | 1 | < 0.1% |
| 0.6289308176 | 2 | < 0.1% |
| 0.6666666667 | 1 | < 0.1% |
| 0.7407407407 | 1 | < 0.1% |
| 0.7518796992 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 7802 | |
| 98.5915493 | 1 | < 0.1% |
| 98.18181818 | 1 | < 0.1% |
| 98.11320755 | 1 | < 0.1% |
| 97.56097561 | 1 | < 0.1% |
| 96.2962963 | 1 | < 0.1% |
| 94.95798319 | 1 | < 0.1% |
| 93.50649351 | 1 | < 0.1% |
| 91.11111111 | 2 | < 0.1% |
| 90.625 | 1 | < 0.1% |
survived
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSINGZEROS| Distinct | 319 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 29671 |
| Missing (%) | 37.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.813913184 |
| Minimum | 0 |
|---|---|
| Maximum | 528 |
| Zeros | 7829 |
| Zeros (%) | 9.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 528 |
| Range | 528 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 24.4240241 |
|---|---|
| Coefficient of variation (CV) | 5.073632026 |
| Kurtosis | 133.225737 |
| Mean | 4.813913184 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 10.44181691 |
| Sum | 238876 |
| Variance | 596.5329533 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 19183 | |
| 2 | 13004 | |
| 0 | 7829 | 9.9% |
| 3 | 3771 | 4.8% |
| 4 | 2578 | 3.3% |
| 5 | 812 | 1.0% |
| 6 | 447 | 0.6% |
| 7 | 224 | 0.3% |
| 8 | 117 | 0.1% |
| 9 | 71 | 0.1% |
| Other values (309) | 1586 | 2.0% |
| (Missing) | 29671 |
| Value | Count | Frequency (%) |
| 0 | 7829 | |
| 1 | 19183 | |
| 2 | 13004 | |
| 3 | 3771 | 4.8% |
| 4 | 2578 | 3.3% |
| 5 | 812 | 1.0% |
| 6 | 447 | 0.6% |
| 7 | 224 | 0.3% |
| 8 | 117 | 0.1% |
| 9 | 71 | 0.1% |
| Value | Count | Frequency (%) |
| 528 | 2 | |
| 507 | 1 | |
| 496 | 2 | |
| 481 | 1 | |
| 468 | 1 | |
| 461 | 2 | |
| 459 | 2 | |
| 441 | 2 | |
| 440 | 2 | |
| 436 | 2 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 77.6 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 71647 | |
| True | 7646 | 9.6% |
df_index
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONUNIFORMUNIQUE| Distinct | 79293 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39646 |
| Minimum | 0 |
|---|---|
| Maximum | 79292 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 619.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3964.6 |
| Q1 | 19823 |
| median | 39646 |
| Q3 | 59469 |
| 95-th percentile | 75327.4 |
| Maximum | 79292 |
| Range | 79292 |
| Interquartile range (IQR) | 39646 |
Descriptive statistics
| Standard deviation | 22890.06178 |
|---|---|
| Coefficient of variation (CV) | 0.5773611912 |
| Kurtosis | -1.2 |
| Mean | 39646 |
| Median Absolute Deviation (MAD) | 19823 |
| Skewness | -2.103152104 × 10-17 |
| Sum | 3143650278 |
| Variance | 523954928.5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2047 | 1 | < 0.1% |
| 4759 | 1 | < 0.1% |
| 10896 | 1 | < 0.1% |
| 8849 | 1 | < 0.1% |
| 14994 | 1 | < 0.1% |
| 12947 | 1 | < 0.1% |
| 2708 | 1 | < 0.1% |
| 661 | 1 | < 0.1% |
| 6806 | 1 | < 0.1% |
| 27288 | 1 | < 0.1% |
| Other values (79283) | 79283 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 79292 | 1 | |
| 79291 | 1 | |
| 79290 | 1 | |
| 79289 | 1 | |
| 79288 | 1 | |
| 79287 | 1 | |
| 79286 | 1 | |
| 79285 | 1 | |
| 79284 | 1 | |
| 79283 | 1 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Unnamed: 0 | event_date | location | injury_severity | aircraft_damage | make | model | amateur_build | number_of_engines | engine_type | total_fatal_injuries | total_serious_injuries | total_minor_injuries | total_uninjured | weather_conditions | phase_of_flight | Year | Month | Day | injuries | pax_onboard | fatality_percentage | survived | AmateurBuilt | df_index | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 77738 | 1982-06-13 | CAMBRIA, NY | Non-Fatal | Destroyed | NaN | 64 | No | 1.0 | Reciprocating | 0.0 | 1.0 | 0.0 | 0.0 | VMC | CLIMB | 1982 | 6 | 13 | 1.0 | 1.0 | 0.0 | 1.0 | No | 77738 |
| 1 | 77526 | 1982-07-01 | MCWHORTER, KY | Non-Fatal | Destroyed | NaN | KR-2 | Yes | 1.0 | Reciprocating | 0.0 | 1.0 | 1.0 | 0.0 | VMC | CRUISE | 1982 | 7 | 1 | 2.0 | 2.0 | 0.0 | 2.0 | Yes | 77526 |
| 2 | 77325 | 1982-07-16 | FREDERICK, MD | Fatal(1) | Destroyed | NaN | WINDWAGON | Yes | 1.0 | Reciprocating | 1.0 | 0.0 | 0.0 | 0.0 | VMC | APPROACH | 1982 | 7 | 16 | 1.0 | 1.0 | 100.0 | 0.0 | Yes | 77325 |
| 3 | 76825 | 1982-08-21 | VENTURA, CA | Non-Fatal | Destroyed | NaN | MIDGET MUSTANG | Yes | 1.0 | Reciprocating | 0.0 | 0.0 | 0.0 | 1.0 | VMC | MANEUVERING | 1982 | 8 | 21 | 0.0 | 1.0 | 0.0 | 1.0 | Yes | 76825 |
| 4 | 76786 | 1982-08-24 | SIDNEY, NE | Non-Fatal | Substantial | NaN | SKYBOLT | Yes | 1.0 | Reciprocating | 0.0 | 0.0 | 0.0 | 1.0 | VMC | LANDING | 1982 | 8 | 24 | 0.0 | 1.0 | 0.0 | 1.0 | Yes | 76786 |
| 5 | 76559 | 1982-09-11 | PLATTSBURG, MO | Non-Fatal | Substantial | NaN | STARDUSTER TOO | Yes | 1.0 | Reciprocating | 0.0 | 2.0 | 0.0 | 0.0 | VMC | TAKEOFF | 1982 | 9 | 11 | 2.0 | 2.0 | 0.0 | 2.0 | Yes | 76559 |
| 6 | 76182 | 1982-10-23 | ELOY, AZ | Fatal(1) | Substantial | NaN | HOBBS B8M | Yes | 1.0 | Reciprocating | 1.0 | 0.0 | 0.0 | 0.0 | VMC | APPROACH | 1982 | 10 | 23 | 1.0 | 1.0 | 100.0 | 0.0 | Yes | 76182 |
| 7 | 52359 | 1990-11-02 | IOWA PARK, TX | Non-Fatal | Substantial | NaN | RANS S-9 | Yes | 1.0 | Reciprocating | 0.0 | 0.0 | 0.0 | 1.0 | VMC | TAKEOFF | 1990 | 11 | 2 | 0.0 | 1.0 | 0.0 | 1.0 | Yes | 52359 |
| 8 | 33935 | 1998-12-05 | MANILLA, Philippines | Unavailable | NaN | NaN | A330 | No | NaN | Unknown | NaN | NaN | NaN | NaN | UNK | NaN | 1998 | 12 | 5 | NaN | NaN | NaN | NaN | No | 33935 |
| 9 | 29539 | 2000-11-28 | Nairobi, Kenya | Incident | Minor | NaN | NaN | No | NaN | NaN | NaN | NaN | NaN | NaN | UNK | NaN | 2000 | 11 | 28 | NaN | NaN | NaN | NaN | No | 29539 |
Last rows
| Unnamed: 0 | event_date | location | injury_severity | aircraft_damage | make | model | amateur_build | number_of_engines | engine_type | total_fatal_injuries | total_serious_injuries | total_minor_injuries | total_uninjured | weather_conditions | phase_of_flight | Year | Month | Day | injuries | pax_onboard | fatality_percentage | survived | AmateurBuilt | df_index | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 79283 | 10469 | 2010-08-17 | Toledo, Spain | Fatal(1) | Destroyed | ZIVKO AERONAUTICS INC | EDGE 540 | No | 1.0 | Reciprocating | 1.0 | NaN | NaN | NaN | UNK | NaN | 2010 | 8 | 17 | NaN | NaN | NaN | NaN | No | 10469 |
| 79284 | 785 | 2016-06-16 | KyviÂÂkes, Lithuania | Fatal(1) | Destroyed | ZIVKO AERONAUTICS INC | EDGE 540 | No | NaN | Reciprocating | 1.0 | NaN | NaN | NaN | UNK | NaN | 2016 | 6 | 16 | NaN | NaN | NaN | NaN | No | 785 |
| 79285 | 20095 | 2005-07-30 | Denison, TX | Fatal(1) | Destroyed | ZIVKO AERONAUTICS INC. | Edge 540-T | Yes | 1.0 | Reciprocating | 1.0 | NaN | NaN | NaN | VMC | MANEUVERING | 2005 | 7 | 30 | NaN | NaN | NaN | NaN | Yes | 20095 |
| 79286 | 1243 | 2016-02-27 | Hong Kong, Hong Kong | Fatal(1) | Substantial | ZLIN | Z242L | No | NaN | Reciprocating | 1.0 | NaN | NaN | NaN | UNK | UNKNOWN | 2016 | 2 | 27 | NaN | NaN | NaN | NaN | No | 1243 |
| 79287 | 32877 | 1999-06-22 | GROSSENHAIN, Germany | Fatal(4) | Substantial | ZLIN | Z-42M | No | NaN | NaN | 4.0 | NaN | NaN | NaN | UNK | NaN | 1999 | 6 | 22 | NaN | NaN | NaN | NaN | No | 32877 |
| 79288 | 16118 | 2007-08-15 | Mosquero, NM | Fatal(2) | Substantial | ZLIN AVIATION S.R.O. | Savage | No | 1.0 | Reciprocating | 2.0 | NaN | NaN | NaN | VMC | MANEUVERING | 2007 | 8 | 15 | NaN | NaN | NaN | NaN | No | 16118 |
| 79289 | 22549 | 2004-05-29 | Mazon, IL | Fatal(1) | Substantial | ZORN | EAA Sport Bi-Plane | Yes | 1.0 | Reciprocating | 1.0 | NaN | NaN | NaN | VMC | TAKEOFF | 2004 | 5 | 29 | NaN | NaN | NaN | NaN | Yes | 22549 |
| 79290 | 3767 | 2014-07-06 | Mattituck, NY | Fatal(1) | Substantial | ZUBAIR S KHAN | RAVEN | Yes | 1.0 | Reciprocating | 1.0 | NaN | NaN | NaN | VMC | MANEUVERING | 2014 | 7 | 6 | NaN | NaN | NaN | NaN | Yes | 3767 |
| 79291 | 38361 | 1996-11-23 | WILLIAMSPORT, PA | Non-Fatal | Substantial | ZUKOWSKI | EAA BIPLANE | Yes | 1.0 | Reciprocating | 0.0 | 0.0 | 0.0 | 1.0 | VMC | TAKEOFF | 1996 | 11 | 23 | 0.0 | 1.0 | 0.0 | 1.0 | Yes | 38361 |
| 79292 | 35863 | 1998-02-22 | WEYERS CAVE, VA | Non-Fatal | Substantial | ZWART | KIT FOX VIXEN | Yes | 1.0 | Reciprocating | 0.0 | 0.0 | 0.0 | 2.0 | VMC | DESCENT | 1998 | 2 | 22 | 0.0 | 2.0 | 0.0 | 2.0 | Yes | 35863 |